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Protein molecules are organized in a structural hierarchy 
The Danish biochemist Kai Linderstrem-Lang coined the terms primary sec 

are s 31 d to be homologous to each other. Since the cor esoondinr S 

Secondary structure occurs mainlv as a helices and 6 strand rZ f 
quences favor either a helices or 3 strands; othe s'fa^or orZZ nfl *" 

ad,acent a helices or p strands dose to each other' ? ° m 

* r . S 7Tl m ° t5fS USU3lly combine "> form compact globular structures whirh 
are called domains. In this book we will use tertiarv J™* mCtures ' K hich 

Many protein molecules have only one rh^n- 

ture. These subunits can function either InS^^^T^T J™* 

cooperatively so rhatthpf,,^- c Hiuepenaently of each other or 
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different SSd£LC I* ™ ™ M ^ several 
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Large polypeptide chains fold into several domains 

example ^ tSeSdfren^ 6111 aSS ° dated With different functi °ns. Fo 
^MftfN^us^^f^' d «f^ in Cha Pter 7, there is one 

C-terminaldom .^^tS^*^ in * at ^ DNAandas «^ 
repressor molecule Polypeptide chains together into a dimeric 
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Figure 2.19 Organization of polypeptide chains into domains. Small 
protein molecules like the epidermal growth factor, EGF, comprise only 
one domain. Others like the serine proteinase chymotrypsin are arranged 
in two domains that are both required to form a functional unit 
(Chapter 15). Many of the proteins that are involved in blood 
coagulation and fibrinolysis, such as urokinase, factor IX, and 
plasminogen have long polypeptide chains that comprise different 
combinations of domains homologous to EGF and serine proteinases 
and, in addition, calcium-binding domains and Kringle domains. 

Domains that arc homologous to the epidermal growth factor, EGF, 
which is a small polypeptide chain of 53 amino acids; 

Serine proteinase domains that are homologous to chymotrypsin, which 
has about 245 amino acids arranged in two domains; 

Kringle domains that have a characteristic pattern of three internal 
disulphide bridges within a region of about 85 amino acid residues; 
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Calcium-binding domain (see Figure 2.13}. 



Domains are built from structural motifs 

Domains are formed by different combinations of secondary structure elements 
and motifs. The a helices and p strands of the motifs are adjacent to each other 
in the three-dimensional structure and connected by loop regions. Sequentially 
adjacent motifs, in other words, motifs that are formed from consecutive 
regions of the primary structure of a polypeptide chain, are usually close 
together in the three-dimensional structure (Figure 2.20). Thus to a first 
approximation a polypeptide chain can be sequentially arranged in a number 
of these simple motifs. The number of such combinations found in proteins is 
limited, and some combinations seem to be structurally favored. Thus similar 
domain structures frequently occur in different proteins with different functions 
and with completely different amino acid sequences. 




Figure 2.20 Motifs that are adjacent in the 
amino acid sequence are also usually adjacent 
in the three-dimensional structure. Triose 
phosphate isomerase (Figure 2.10) is built up 
from four fJ-a-(5-<x motifs that are consecutive 
both in the amino acid sequence (a) and in the 
three-dimensional structure (b). 
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Simple motifs combine to form complex motifs 
combined to m § a ke , mor 7co m ^x ToT """^ b> ' ' ! °° P ^ can * 

™;ss n s in n r 2 - 2ia - Tte — ■» 

transcarbamylase (Figure 2.11a) ' ° °" e domain of aspartate 

There are 1 1 additional wavs to build un a R <h» ot „ f ( 
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occurs 1 1 times out of a total number S» arrangement ,n Figure 2.21a 
P strands. In addition, arrangemen 2 2H and ^ U " fra S me f 5 adjacent 
are the Greek kev motifs previa Jiv Hit ?t ° CCUr 6 times each - ™ese 
arrangements do not £ S nf°f ° f the the0r «-»>' Possible 
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figure 2.21 Two sequentially adjacent hairpin 
mo,.fe can be arranged in 12 different wavs in 

of thT ° f f ° Ur StfandS - 131 T °P° l0 S- diagram 
for the arrangemen. when sequentially adjacent 
6 strands are adjacent in the structure.' <b-i, 

i are the Greek key motifs. The topologv 
diagram in j is the mirror image of that'in 

thaMn F T di3Sram in 1 !S the ™™ « 
e^rJ IT 6 bW With the chain direction 
reversed. These four diagrams represent the 
Greek key motif. 



Protein structures can be divided into three main classes 
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^'gure 2.15). These structures can be considered 
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to belong to a small fourth group. In addition to these groups, there are a number 
of small proteins that are rich in disulfide bonds or metals and form a special 
group. The structures of these proteins seem to be strongly influenced by the 
presence of these metals or disulfides and often look like distorted versions of 
more regular proteins. 

The domains of the known protein structures are classified according to 
Levitt and Chothia's scheme in this book. The three main classes a, P, and a/p 
will be examined in more detail in Chapters 3-5. 



Conclusion 

The interiors of protein molecules contain mainly hydrophobic side chains. The 
main chain in the interior is arranged in secondary stuctures to neutralize its 
■ polar atoms through hydrogen bonds. There are two main types of secondary 
structure, a helices and p sheets, p sheets can have their strands parallel, an- 
" tiparallel, or mixed. 

Protein structures are built up by combinations of secondary structural 
elements; a helices, and p strands. These form the core regions™the interior of 
the molecule— and they are connected by loop regions at the surface. Schematic 
and simple topological diagrams where these secondary structure elements are 
-'; highlighted are very useful and are frequently used, a helices or P strands that 
c: are adjacent in the amino acid sequence are also usually adjacent in the three- 
; dimensional structure. Certain combinations are especially frequent and are 
..^called motifs, for example, the helix-loop-helix motif and the hairpin motif. 
■^iSlwo helix-ioop-helix motifs, each with its own specific geometry and amino 
-iS-'add sequence requirements, are used in many different proteins, one motif for 
.j||>DNA binding and one for calcium binding. 

-v^^^The P-a-p motif, which consists of two parallel P strands joined by an a 
■M occun in almost all structures that have a parallel P sheet. Four antiparallel 
;^|Srands, arranged in a specific way, the Greek key motif, are frequently found 
^^^Tstructures with antiparallel P sheets. 

^Polypeptide chains are folded into one or several discrete units, domains, 
hich are the fundamental functional and three-dimensional structural units. 
Scores of domains are built up from combinations of small motifs of 
Sadary structure, such as a-loop-a, P-loop-P or P-a-P motifs. Domains are 
sifted into three main structural groups: a structures, where the core is built 
Jg^usively from a helices; P structures, which comprise antiparallel p sheets; 
structures, where combinations of p-a-P motifs form a predominantly 
ilel p sheet surrounded by a helices. 
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